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Abstract 

We study a problem of finding an optimal stopping strategy to liquidate an asset 
with unknown drift. Taking a Bayesian approach, we model the initial beliefs of an 
individual about the drift parameter by allowing an arbitrary probability distribution 
to characterise the uncertainty about the drift parameter. Filtering theory is used to 
describe the evolution of the posterior beliefs about the drift once the price process 
is being observed. An optimal stopping time is determined as the first passage time 
of the posterior mean below a monotone boundary, which can be characterised as 
the unique solution to a non-linear integral equation. We also study monotonicity 
properties with respect to the prior distribution and the asset volatility. 


1 Introduction 


It is an inevitable feature of human economic activity that prices of goods vary in time. 
Thus, naturally, a person participating in trade cares much about the best time to perform 
a transaction. Let us think about an individual who possesses an indivisible asset with 
price evolution {St}t>o and wants to sell it before time T > 0. Assuming a liquid market, 
how should the seller choose a selling time to maximise his/her profit from the sale? 
Mathematically, the question is about finding a stopping time t* , belonging to a set of 
admissible stopping times Tt, such that 


E[Sr*] = sup E[Sr]. (1.1) 

tSTt 

A natural set of admissible stopping times Tr to consider is the set Trp of stopping times 
with respect to the price process S, i.e. at any point in time, the decision whether to sell 
the asset or not must be based solely on the price history of S. Throughout this article 
we assume Tt = ■ 

In the context of the classical Black-Scholes model 


dSt = aSt dt + a St dWt , 


( 1 . 2 ) 


where a, a are known constant parameters, the answer to the 
(1.1) is straightforward: if a > 0, then the optimal strategy is to 
T; if a < 0, then the optimal strategy is to sell immediately, i.e. 
any stopping time r is optimal. 


optimal selling question 
sell at the terminal time 
at time 0; if a = 0, then 
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However, in applications, the known constant drift assumption of the Black-Scholes 
model is usually too strong. To obtain reasonable precision when estimating the drift one 
needs very long time-series, which are rarely available. An extreme example of the lack of 
data is a stock of an initial public offering (IPO) for which the price history simply does 
not exist. Furthermore, even in those few cases where enough past data is available, the 
benefit of accurate calibration of the historical drift is most likely to be outweighed by the 
model risk introduced. This is because most financial models, including Black-Scholes, 
are only plausible as short-term models; the simplistic constant parameter assumptions 
are non-viable over longer time periods. On the other hand, the assumption of known 
volatility parameter a is justifiable as it can be estimated, at least in theory, from an 
arbitrarily short observation period. 

Though the notoriety of the drift estimation pushed much of financial mathematics 
literature to focus on questions where the drift parameter can be avoided or at least does 
not play a crucial role (e.g. risk-neutral pricing and hedging), in the optimal liquidation 
problem, the drift can have a noticeable effect. Figure containing the estimated Black- 
Scholes model parameters of a few famous IPOs over the first year since going public, 
suggests that it is unlikely that the price change in all these cases was due to the volatility 
alone, leading us to believe in the significance of the drift contribution that needs to be 
addressed. 


IPO 

log(5i/S’o) 

a 

(T 

Amazon (1997) 

1.34 

1.68 

0.83 

Google (2004) 

1.03 

1.11 

0.41 

Facebook (2012) 

-0.42 

-0.27 

0.55 

Vonage (2006) 

-1.53 

-1.29 

0.70 


Figure 1: The estimates a and a of the drift a and the volatility a are calculated over the 
first year of an IPO using the daily closing prices. Data source: Google Finance. 


Admission that due to unattainable calibration, in many situations, modelling a price 
by a geometric Brownian motion with a known constant drift is ill-suited is not a reason to 
give up modelling, but a mere indication that the model should be improved to incorporate 
extra factors. As the exact value of the drift parameter is unknown, we choose to model 
the inherent uncertainty about the drift by a probability distribution. More precisely, we 
extend the geometric Brownian motion model (1.2) by replacing the constant drift a by 
a random variable whose distribution (called ‘a prior’ in Bayesian statistics) incapsulates 
all the knowledge available to us concerning the uncertainty about the drift. As far as the 
volatility a is concerned, we stay with the known constant volatility assumption. 

A potential practical application of this drift uncertainty modelling is in the optimal 
liquidation of an IPO share. A person possessing a share of an IPO has only beliefs about 
the drift of the price process as no past price data is available to calibrate the model. 
Though in this article we view the prior distribution as subjective beliefs whose origin we 
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do not question, one can also think of transparent constructive approaches for choosing 
a prior. A possible approach in the IPO example is to use the empirical distribution of 
the returns of similar IPOs over the initial period of the same length as our investment 
horizon T. The similarity criteria could be the market sector, the country, the market 
share, etc. 

For an agent interested in optimally liquidating only the idiosyncratic (i.e. stock- 
specific) component of a stock price, the sequential procedure is even more beneficial due 
to reduced volatility (relatively high volatility in Figure could be seen as diminishing 
the advantage of the sequential liquidation procedure). A simple structural Black-Scholes 
model including an idiosyncratic and a market factor (e.g. see a) suggests that the id¬ 
iosyncratic price component 

St/It oc 


Here It is a large-basket index representing the market factor, , W' denote the idiosyn¬ 
cratic drift, the idiosyncratic volatility, and the idiosyncratic random driver, respectively. 
As fj' < cr, learning about the idiosyncratic drift is faster than learning about the total 
drift /r, so applying the sequential procedure in such a situation is even more advantageous 
than in the standard case. 


In this article, we solve the optimal liquidation problem (1.1) within the proposed 
model under an arbitrary prior distribution for the drift. The first time the posterior 
mean of the drift passes below a specific non-decreasing curve is shown to be optimal; the 
stopping boundary is characterised as the unique solution of a particular integral equation. 

To include more details, our investigation of the optimal strategy can be briefly de¬ 
scribed in the following. The original problem with incomplete information about the 
drift is reformulated as a complete information problem by projecting the price evolution 
onto the observable filtration using filtering theory. The mean of the posterior distribu¬ 
tion becomes the underlying process of a new equivalent optimal stopping problem with a 
stochastic killing/creation rate and a constant payoff function. This conditional mean is 
shown to satisfy a stochastic differential equation driven by the innovation process. The 
dispersion coefficient of the SDE is proved to be decaying in time as well as satisfy a special 
condition on the second spatial derivative. Embedding the value function into a Markovian 
framework and making a suitable connection with the term-structure equation, the estab¬ 
lished dispersion function properties enable us to employ the available convexity results 
to prove convexity of the Markovian value function in the spatial variable. Moreover, the 
value function is shown to be continuous and decreasing in time. These significant facts 
allow us to show that the first passage time below a monotone boundary is an optimal 
stopping time, so techniques from the theory of free-boundary problems with monotone 
boundaries can be applied. Specifically, the monotonicity of the boundary enables us to 
prove the smooth-fit property and to investigate the corresponding integral equation. The 
optimal stopping boundary is characterised as the unique non-positive and continuous 
solution to a non-linear integral equation. 
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Besides the examination of the optimal strategy, we investigate monotonicity prop¬ 
erties of the expected optimal liquidation value with respect to the asset volatility and 
the prior distribution. Notwithstanding that all-inclusive theorems about parameter de¬ 
pendence appear currently to be beyond reach, we derive some sufficient conditions for 
monotonicity in the volatility a as well as the prior distribution. In addition, we conduct 
numerical experiments in the case of the normal prior; some results reinforce standard in¬ 
tuition, others illustrate inherent subtleties. In particular, additional value that an optimal 
strategy involving filtering brings over an optimal strategy without filtering is calculated, 
exhibiting an improvement of up to 10% for some feasible parameter regimes. 

As far as extensions of this work are concerned, solving the optimal liquidation prob¬ 
lem for more general diffusions, with possibly time- and level-dependent coefficients, is 
more problematic. Such extensions typically lead to the loss of the useful one-dimensional 
Markovian structure present in the classical geometric Brownian motion setting; an op¬ 
timal decision then depends on the whole price trajectory rather than the current spot 
price alone. Clearly, a complete treatment of the resulting optimal stopping problem is 
much more complicated. 

1.1 Literature review 

Over the last three decades, investment problems with incomplete information about 
the drift has received much attention from both financial mathematicians and financial 
economists. Some of the most distinct works on portfolio optimisation include [6] , regarded 
as the first incomplete information problem studied in financial literature, and the general 
portfolio problems studied in da HE]; see also the recent article [3] proposing a general 
framework for most of the earlier works as well as containing an excellent survey with 
references. Hedging in an incomplete market under partial information about a constant 
drift was addressed in m in the case of the Kalman-Bucy filter. In addition, incomplete 
information models have been investigated in the financial economics literature (see the 
survey paper [2] as well as the monograph m)- 

In contrast, there have been surprisingly few attempts to tackle financial optimal 
stopping problems under incomplete information such as the optimal liquidation problem 
above, with the existing works focusing mainly on a very restrictive case, namely, the two- 
point prior. The optimal liquidation of an asset with unknown drift has been studied in [8] 
and of an asset with unknown jump intensity in |16) . For option valuation problems under 
incomplete information, see [5] and m- The financial optimal stopping articles above 
typically assume a two-point prior distribution; having in mind that the prior represents 
the beliefs about all the different values the parameter could possibly take, the two-point 
prior strikes as a simplistic and unrealistic assumption. Overcoming this assumption is one 
of the main contributions of the present article. It is also worth mentioning that various 
different formulations of the optimal selling problem in the case of complete information 
about the parameters have been studied in m, and |12j . 
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2 The model and problem formulation 


We consider a financial market living on a stochastic basis F, P), where the filtra¬ 

tion F = {Tt}t>o satishes the usual conditions and the measure F denotes the physical 
probability measure. The basis supports a Brownian motion W and a random variable 
X such that W and X are independent. We assume that the observed price process S 
evolves according to 


dSt = XStdt + aStdWt, ( 2 . 1 ) 

where the volatility u > 0 is a known constant. We write F'^ = for the filtration 

generated by the price process S and augmented by the null sets of T. In this article, F'^ 
corresponds to the only available source of information, i.e. an agent can only observe the 
price process S, but not the random driver W or the drift X. The distribution of X, which 
we denote by n, represents the subjective beliefs of the individual about the likeliness of 
the different values the mean return rate X may take. 

The optimal selling problem that we are interested in is 

y=supE[5^], (2.2) 

where Trp denotes the set of F'^-stopping times that are less or equal to a specified time 
horizon T > 0 . 

Note that if the support of /r is contained in [ 0 , 00 ), then an optimal strategy is to 
stop at the terminal time T. Similarly, if the support of /x is contained in (—oo,0], then 
an optimal strategy is to stop immediately. To exclude these trivial cases, we from now 
on impose the assumption that /u((— 00 , 0 )) 7 ^ 0 and /x(( 0 , 00 )) / 0 . 

Remark The inclusion of a constant discount rate r > 0 is straightforward. Indeed, the 
discounted price St := e~^^St satisfies 

dSt = (X -r)Stdt + aStdWt, (2.3) 

and so the optimal stopping problem 


sup E[e 


rer/ 


reduces to ( 2 . 2 ) but with the prior distribution replaced by //(• -|- r). 


2.1 Equivalent reformulation under a measure change 

Assuming that fj, has a hrst moment, Xt := E[A | Xt] exists, and the process 

1 /■* 

Wf.= - {X-Xs)ds + Wt, 

^ Jo 
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known as the innovation process, is an F'^-Brownian motion (see [H Proposition 2.30 on 
p. 33]). Writing F'^ = {jf }t>o for the completion of the filtration {a{Ws : 0 < s < t)}t>o, 
we note that F'^ = F'^ (see the remark on p. 35 in [T]). 

Defining a change of measure by the random variable 


and writing 





we have 


E[Sr] =E^ 


Jo ^sds 


where r G 7^. Without loss of generality, we assume 5o = 1 throughout the article; the 
optimal stopping problem (2.2) then becomes 


V = sup 


rer® 


(2.4) 


We also note that, by Girsanov’s theorem, the process Zf := —at + Wt is a Q-Brownian 
motion on [0, T]. 


3 Analysis of the optimal stopping problem 


3.1 Projecting onto the observable filtration 

Let us introduce Yt := Xt + aWt so that St = Clearly, the processes Y 

and S generate the same filtrations. The following proposition describes the conditional 
distribution of X given observations of the stock price in terms of the current value of the 
process Y. For its proof, see Proposition 3.16 in [T]. 

Proposition 3.1. Let q ; M —)> M satisfy \q{u)\fj,{du) < oo. Then 

2uYf—v?‘t 

E jiz-f ] = E 1<1{X)\Y,] = LYJYXXYI 

/r e 2 .^ fi{du) 

for any t >0. 


by 


By Proposition 3.1 


the distribution nt,y of X at time t conditional on 17 = y is given 


Tt,y{du) 


2uy — u^t 

e y{du) 


2uy — u^t 


y{du) 


(3.1) 
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and 


(3.2) 


Xt = E[X I ] = E [X\Yt] = fit, Yt) 


for any f > 0, where 


f{t,y)= / unt,yidu) = 


2uy — u‘^t 

Jr 

2uy — u'^t 

f^e 2.^ /j(du} 


As a shorthand, we denote by Ef^y the expectation operator under the probability measure 
P,J.):=P(.lY = y). 

From now onwards, the following integrability condition on /r is imposed. 


Assumption 3.2. The prior distribution /x satisfies 


I 

Jr 


/x(du) < oo 


(3.3) 


for some a > 0. 

This assumption is an insignificant restriction on our optimal liquidation problem, 
since, given any probability distribution /x, the distributions p,t,y in (3.1) satisfy (3.3) for 
any t > 0. The main benefit of Assumption |3 . 2| is that it allows us to extend the definition 
of fit,y in (3.1) to f = 0. Indeed, suppose that fj, satisfies (3.3) with a = ejiflo'^^ for some 
e > 0 . Defining a probability distribution ^ on M by 


^(du):= 


eYY pidu) 

~ d? ’' 

Jj^e^fx(du) 


(3.4) 


the measure 


coincides with 


fj.o,y((iu) = 


eJ^ /x(dn) 

uy 


6,3/(dn) := 


2uy — u^e 

e 2^^ 6d^i) 


Jr ® 


2uy — u^e 


^idu) 


Consequently, the distribution can be identified with a conditional distribution at 
time 0 given that the prior distribution at time — e was and the current value of the 
observation process is y. This gives us a generalisation of the notion of the starting point 
of the observation process Y to allow Yq = y 7^ 0, so we may regard time 0 as an interior 
point of the time interval. 

Next, we establish a bijective correspondence between St and Xt. For it, let ly de¬ 
note the interior of the smallest closed interval containing the support of y, i.e. ly = 
(inf(supp(/i)),sup(supp(/i))). 
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Lemma 3.3. For any given t > 0, the function f{t, •) : M —defined above is a strictly 
increasing continuous bijection. 


Proof. Thanks to Assumption 3.2, it suffices to prove the claim only for t = 0. Differen¬ 
tiation of / under the integral sign yields d 2 f{ 0 ,y) = — Eo,j/[X]^), which is 

strictly positive and finite for all y G M. As a result, y i—)• /(O, y) is strictly increasing. For 
surjectivity, we need that /(0,y) —>■ sup/^ as y —)• oo and /(0,y) —)• infas y —)• —oo. 
We only prove the first claim as the second one then follows immediately by symmetry. 

Let 9 £ I^n (0, oo), y > 0, and consider 

/ uy I uy I uy Oy I wy 

/ ue^fj.{du) — 6 / e^fi{du) = / (u — 9)e^^{du) = / we^y{9 + dw), (3.5) 

t/M M t/M •.'K. 


where w := u — 9. As the minimum of tc i—)• we^^ is attained at w = —a^jy, we have 




/ we^fi{9 + dw) > — 

' (— 00 , 0 ] y 


a 


/ y{9 + drc) >-. 

'(—oo,o] y 


Furthermore, 


/ luy 

/ weiPfi{9 + dw) ^ oo 

J (0,oo) 


as y —)• oo by monotone convergence. Consequently, from (3.5) follows that f{0,y) > 9 
for all large enough y. Since 9 £ was arbitrary, we conclude that /(0,y) —>• sup/^ as 
y —7- oo, which finishes the proof. □ 

Writing E^ = {T^}t>Q for the completion of the filtration generated by X and writing 
Tr^ for the set of E^-stopping times not exceeding T, we formulate the following immediate 
corollary. 

Corollary 3.4. E"^ = E^ and Tip = Tp . 


A consequence of this corollary is that the optimal stopping problem (2.4) can be rewritten 


as 


F = sup E®[e^o"Wd.j^ 

rer/ 

Looking for a more tractable characterisation of A, we find an SDE representation of X 
with respect to the observations filtration E'^. An application of Ito’s formula to Xt = 
f{t,Yt) yields 

dXt = c7d2f{t,Yt)dWt. (3.7) 

Introducing the notation fi := f{t, ■), we define 


fi{t,x) := ad 2 f{t,fi ^(x)) 






and, from (3.7) above, obtain a stochastic differential equation 

dXt = i^{t,Xt)dWt 


for the conditional mean Xt. Rewriting of the equation in terms of the Q-Brownian motion 
Zt = —at + Wt results in 


dXt = aiPit, Xt) dt + Xt) dZt. 


(3.8) 


The dispersion V’ can be expressed more explicitly (by differentiating / under the integral 
sign) as 

V’(t,x) = i {Et^y^^t)[X^]-Et^y^(^t)[X]‘^) = ^Vari,y^(t)(X), 

where the notation yx{t) := ff^{x) is used (note that yxit) is the unique value of the 
observation process Yt that yields Xt = x). 


Example (The two-point prior) Suppose /r = ndh + (1 — TT)di, where 6i,6h denote the 
Dirac measures at /, G M respectively. Then x) = d(^h — x){x — 1). 


Example (The normal prior) Suppose // is the normal distribution with mean m and 

variance 7 ^. Then the conditional distribution P(-|yt = y) = y‘t,y is also normal but with 
2 2 2 2 2 
mean and variance • Consequently, ^{t,x) = ■ 

3.2 Dispersion of the conditional mean 

The following inequality will be the key to understanding the dispersion function ?/). 
Proposition 3.5. Let X he a random variable with E[X^] < 00 . Then 

E[X^]E[x2] + 2E[x3]E[x2]E[X] - E[X‘^]E[X]2 - E[x3]2 - E[x2]3 > 0 

with the equality if and only if X has a one-point or a two-point distribution. 

Proof. Let X, Y, Z be independent and identically distributed random variables with 
E[X^] < 00 . Observe that 

E[(X - Y)^(Y - Z)^(Z - X)2] 

= E[x*{Y^ + z^) + y^(z^ + x^) + z^(x^ + y^)] 

-2E[X‘^yy + y^yx + z '^ xy ] 

+2E[X^{Y‘^Z + Z'^Y) + Y^{Z‘^X + X^Z) + Z^{X^Y + y^x)] 

-2E[X^Z^ + Y^X^ + _ qe[X^Y^Z^] 

= 6(E[X^]E[x2] - E[X'^]E[X]2 + 2E[x3]E[x2]E[X] - E[X3]2 - E[X^f), 

where the last equality holds because X, Y, Z are i.i.d. It is clear that 

E[(x - y)2(y - z)2(z - x)2] > o 

with the equality if and only if X has a one-point or a two-point distribution. This finishes 
the proof of the claim. □ 
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Remark We are grateful to Johan Tysk for providing an alternative proof of the above 
proposition based on the Pythagorean theorem in spaces. 

Proposition 3.6 (Properties of the dispersion function il:!). 

1. For any x G I^, the function t i—>■ f^{t,x) is non-increasing. It is strictly decreasing 
unless is a two-point distribution, in which case t !->■ xf^t, x) is a constant. 

2. ^ “f ® strict inequality unless g, is a two-point distribution, in which case 
we have equality. 

3. If fi is compactly supported, then ip is bounded. 

Proof. 1. Recall the notation yx{t) = f^^{x), and consider 





Implicit differentiation using the identity x = f{t,yx{t)) gives that 


2 


which substituted into the last expression above yields 


diip{t,x) 


- Var,,,^(,)(X^) Var,,,^(,)(X) 
2a3Var,,,^(,)(X) 






Now, the claim follows from Proposition |3.5| applied to the term between the paren¬ 
theses. 
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2. By the chain rule applied to the definition of V’j we have 

x) = yx{t))d 2 y{t, x), 

where y{t,x) := yxit)- Here 

1 d 


= ^ (Ei,,[X3] - mt,y[X^]Et,y[X] + 2Et,y[Xf) 
by straightforward differentiation under the integral sign, and 


d2y{t,x) = 


1 


a 


d 2 f{t,y{t,x)) Xavt^y^(^t){X)' 


by implicit differentiation. Hence 


d 2 'ip{t, x) = - 

G \ 

It remains to establish the inequality 


1 f 


Vart,y4t)(X) 


- 2x 


d2'4’{t,x) + - = - — 
a a oy 


1 d fEt,y[X^]-Et,y[X^]Et,y[X] 


Vart,y(X) 


d2y{t,x) > 0. 


y=yx{t) 

As d 2 y > 0, equivalently, it suffices to prove the non-negativity of 


q{t,y) := Xait,y{X) — 


2 5 (Et^y[X^]-Et,y[X^]Et^y[X] 


Vart,y(X) 


= {Et,y[X^])Xar,,y{X)-E,,y[X^]-^ (Vari,,X) 

Vari,,(X) + Et,,[x2]Ei,,[X]-Vari,,(X). 


Further differentiation yields that 


1 




q{t,y) = Et,y[X^]Et,y[X^] + 2Et,y[X-^]EtjX^]Et,y[X] 


-Et,y[X^]Et,y[Xr - EtjX^f - EtjX'^] 


-2i3 


Thus, by Proposition<? > 0; moreover, q > 0 for all priors ^ except the two-point 
distribution in which case q = 0. 

3. The well-known identity 


^t,y^it)[\^\‘^] = 2 / wEt,y^(i)(|A| > n)du 

J [0,oo) 

ensures that 'll; is bounded for compactly supported distributions. 


□ 
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Remark 

1. It is possible to come up with a contrived example of a prior distribution for which 
the dispersion is unbounded. For this, think of a discrete probability measure 
supported on an infinite number of points xi < X 2 < ■ ■ ■ < Xn < ■ ■ ■ such that 
Xn — Xn-i —)> oo as n —>■ oo. Using the notation Xn '■= (xn-i + Xn)l2 for the mean 
between neighbouring points, the value Xn) = Uart^s^(X)/cj —>■ oo as n —>■ oo by 
comparison with a two-point distribution concentrated at the points Xn-i and Xn- 

2. Let us stress that compact support of the prior is by no means a necessary condition 
for the boundedness of V'- For instance, we know that is bounded in the case of 
a normal prior as seen in the example on page Though a rigorous investigation 
into precise technical conditions on the prior for the boundedness of ijj appears to be 
involved enough to be omitted in this article, we conjecture, based on numerical in¬ 
vestigations, that V' is bounded for any prior admitting a density that monotonically 
approaches zero outside a large enough finite-length interval around the origin. 

As the boundedness of appears to be satisfied by any conceivable prior of interest in 
practical applications, we make it an assumption in the rest of the article. 

Assumption 3.7. The prior distribution // is such that Varo,j;(A) < oo for all y G M. 


3.3 The Markovian value function and the optimal strategy 


into a Markovian framework. To do that, define 


Using the dynamics ( |3.8[ ) of X, we are able to embed the optimal stopping problem (3.6) 

{t,x) E [0,r] X If,, 


v{t, x) = sup 
rSTr-t 




(3.9) 


where the process X = X^’^ is given by 

f dXt+s = + s, Xt+s) ds -h -b s, Xt+s) dZt+s {s > 0), 

\ Xt = x, 

and Tr-i denotes the set of stopping times less or equal to T — t with respect to the 
completed filtration of {Aj+s}s>o- 
Let us define the sets 


C = {(t,x) E [0,r] X If, : v{t,x) > 1} 

and 

V = {{t, x) E [0, T]x If,-. v{t, x) = 1}, 

which we will soon show to correspond respectively to continuation and stopping sets of 
an optimal strategy. Note that 

V > 1 (3.10) 
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(3.11) 


everywhere and that v{T, x) = 1, so C U H = [0, T] x and the random time 

Tj) := inf{s > 0 : (t + s, G V} 

satishes tx> <T — t. 

Proposition 3.8 (Optimal stopping time). The value function v is finite, and the time 
Tx> defined in (3.11) is an optimal stopping time. 

Proof. Without loss of generality, assume that t = 0, and let x G M. By Theorem D.12 in 
m, to prove the claims it suffices to show that 


sup exp 
0<t<T 


X^’^ds 


< oo. 


By the Dambis-Dubins-Schwartz theorem, there exists (possibly on a larger probability 
space) a Brownian motion B such that 


fi{s, X^’"") dZs - Bj-t .0(4 


/o 


If m > 0 is a constant dominating if, then 
rt 


sup exp 
0<t<T 


Xg’^ ds ) < exp ( T sup X 


'0 


'rO,X 


0<t<T 


< exp (r (x + amT + ^sup^ B ) 


< exp ( T ( X + amT + sup Bt 

0<t<m^T 


Thus 


sup exp 
0<t<T 


x°’^ds 


< E*® exp (T(x + umT + sup Bt)) 

0<t<m'^T 

= exp (T (x + amT)) E*® [exp (T|B^ 27 ^|)] 

< oo. 


where the equality comes from the reflection principle. □ 

Since V’ is continuously differentiable, it is Lipschitz continuous on any compact subset 
of [o,r] X I To avoid additional technical complications, from now on we impose a 
slightly stronger assumption of Lipschitz continuity on the whole of [0, T] x 

Assumption 3.9. The function if is Lipschitz continuous in the second variable, i.e. there 
exists K > 0 such that \if{t, x) — ip{t, y)\ < K\x — y\ for all t G [0, T] and all x, y G 

We remark that our canonical examples of the normal and the two-point prior both fulfill 
Assumption |3.9[ 
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Theorem 3.10 (Properties of the value function). 

1. The function x v(t,x) is non-decreasing and convex for any fixed t G [0, T], 

2. The function 1 1 — v{t, x) is non-increasing for any fixed x G 

3. The value function v is continuous on [0, T] x 

4- There exists a non-decreasing and continuous function h : [0, T] —)■ (—00,0] with 
h{T) = 0 such that C = {{t,x) G [0, T) x : x > h{t)}. 

5. The value function {t, x) i—)• v{t, x) solves the boundary value problem 


div + (T'i/’(t, x)d2V + \'ip{t, + XU = 0, X G C, 

V = 1, X G V. 


(3.12) 


Furthermore, the smooth-fit property holds in that the function x i— v{t,x) is for 
all t G [0, T]. 


Proof. 

1. (i) The monotonicity of x i— v{t,x) is clear from the representation 

v{t,x) = sup E*® 
reTr-t 


(3.13) 


of the value function together with a comparison theorem, see [241 Theo¬ 
rem IX.3.7]. 


(ii) Let us define VE{t,x) := 
where R = —X and so 


rT — t \rt,x 1 

gio ^t+s 


and UE{t, r) := 


rT — t fyt.r j 

=- Jo ^t+s ds 


dRt = -CF'tpit, -Rt) dt - ipit, -Rt) dZt. 

Then VE{t, x) = UE{t, —x). Now, the convexity result follows by approximating 
the value function, starting with ue as the first approximation, by Bermudan 
options, which preserve convexity by [U Theorem 5.1] with the needed condi¬ 
tion ^ for the theorem to hold ensured by Proposition 


3.6 


2. From Proposition 3.6, the dispersion if is non-increasing in t, so the claim follows 
by the Bermudan approximation argument for the value function and Theorem 6.1 
in p. 


3. First, let I G and we will show that there exists a constant K > d such that, 
for every t G [0, T], the map x i—)■ v{t,x) is iL-Lipschitz continuous on (—oo,/] H 
Assume for a contradiction that there is no such K. Recall that convexity of a single¬ 
variable function implies continuity and existence of one-sided derivatives. Hence 
using a characterisation of convexity saying that a real-valued function / defined on 
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an interval is convex if and only if the function {xi,X 2 ) e-)> (/(X 2 ) — f{xi))/{x 2 — xi) 
is increasing in both xi and X 2 , we obtain that there is a sequence {tn\n>o C [0,r] 
such that the sequence of left-derivatives diverges to 00 . However, taking 

e G (0,sup/^ — 1), this would imply that v{tn,l -|- e) —)• 00 , contradicting the fact 
that v{tn, / + e) < ^(0, Z -|- e) < 00 for all n G N. 

To finish the proof of the continuity of u, it suffices to show that v{t,x) is continuous 
in t. To reach a contradiction, assume that t 1 — v{t,xo) is not continuous at t = to 
for some xq- By time-decay, this means that v has a negative jump. 

First consider the case when v{to—,xo) > v{tQ,xo). By Lipschitz continuity in the 
second variable, there exists a rectangle TZ = {Iq — 6, to) x (xq — (5, xq -b 6) with <5 > 0 
such that 


inf v{t,x) > v{to,xo + 6). 

{t,x)£Tl 


(3.14) 


Thus 7^ C C. Let t £ {to — 5, to) and tti := inf{n > 0 : {t + u, ^ TZ}. Then, by 

martingality in the continuation region (see m Appendix D]), 


v{t,xo) = 
< 


Jo° ^t4>0'i“u(t,Xo -b 5)l{t+rT^<to} 


+eQ [e/o“-‘ ^l’:y^<^-v{to, xo + 

< Xo -b S)Q{t + T'ji < to) + + <^) 

->■ v{to,xo + J) 


as t ^ to, which contradicts (3.14). 


Next, consider the case when u(to)a;o) > v{to+,xo). We begin by investigating the 
situation v{to,xo) > v{to+,xo) > 1. By Lipschitz continuity of v in the second 
variable and its decay in time, there exists TZ = {to, to -b e] x [xo — 6, xo + (5] with 
e > 0 and <5 > 0 such that 


u(to,a;o)> sup v{t,x)> inf v{t,x)>l. (3.15) 

{t,x)£n (t,x)en 

In particular, TZ C C and writing tti := inf{u > 0 : {to + u, ^ TZ} we have 


II 

0 

0 


< E*® 

g/o Xo -b 

-bE*® e-^° + 6) a:o + <^)1{' 


< ajo + S)Q{tti < e) + + e, a;o + 5) 

u(to+,a;o + S) 
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as e \ 0, which contradicts (3.15). 


Alternatively, suppose that v{to,xo) > v{to+,xo) 
the second variable, there exists h > 0 such that 


= 1. By Lipschitz continuity in 


inf v{to,x) > v{to+,xo) = 1. 

xG(xo—S,xo) 


(3.16) 


Then {to,T] x (xq — 5,xo) (ZV and so the process hits the stopping region 


immediately, implying that {tQ,XQ — 5/2) G V] this contradicts (3.16). 


4. Existence of a non-decreasing boundary h : [0, T) —)■ [—oo, oo] satisfying C = {(t, x) G 
[0,T) X : X > h{t)} is a direct consequence of the first two parts above, and we 
can then define h{T) = lim^ h{t). Non-positivity of h is clear from the expression 


(3.9), since, for any starting point {t,x) G [0, T) x (0, oo), the strategy of stopping 
at the first time hits 0 gives a value strictly greater than 1. 

To show that h is bounded from below, assume for a contradiction that {0} x 


(— 00 , 00 ) C C. Hence, defining ^ as in (3.4), we know that (—e,0] x M C Q, where 
Q denotes the continuation region for the optimal selling problem started at time 
—e < 0 with the prior Writing to denote the Markovian value function for 
the selling problem from time —e, let —t' G (—e,0) and let a < 0 be such that 
v^{—t',a) < . Now, let x G (— 00 , 0 ), and observe that 


u^{—t',x) < ug(—t', a)P( sup 

V 0<u<t' 

—)■ a) < 1 




] + , a)¥( sup 

/ V 0<u<t' 


xzttu > « 


as X \ — 00 . This gives a contradiction since ug > 1 by definition. As a result, we 
can conclude that h{t) G (—oo,0] for all t G [0,T]. 

For the continuity of h, note that continuity together with time-decay of v imply that 
h is right-continuous with left limits. Assume for a contradiction that h{to—) < h{to) 
for some to G (0,T). Take points xi,X2 with h{to—) < xi < X2 < h{to), let 
X = (xi + X2)/2, and consider the rectangle TZ = {to — 5, to) x (xi,X 2 ) C C for some 
(5 > 0. For t G (to — to), 


v{t, x) = 


(3.17) 


eSo'^^lZu 'i^y{t + m, 

ZQiTn <to - t)v{to - 5,X2) + 

Now, estimating ttz above with the leaving time of TZ for a Brownian motion with 


drift (compare the proof of Proposition 3.8), it is straightforward to check that 


{tti < to — t) = o{to — t). Since X 2 < 0, (3.17) thus implies that v{t,x) < 1 for t 


close to to) which contradicts (3.10). 


The above argument also works to show that h{T—) = 0. 
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5. The proof of (3.12) is along standard lines (e.g. see m Theorem 7.7 in Chapter 2]), 
so we do not include it here. 

Let us next establish the smooth-fit property. Since x i—)■ v(t,x) is non-decreasing, 
it suffices to show that 

j- vit,h{t) + e) - v{t,h{t)) ^ ^ 
eto e ~ 

Without loss of generality, let t = 0. Writing x = h{0), it is enough to show that 

v{t, X -|- e) — v{t, x) = o(e) as e \ 0. 

Denoting the optimal stopping time when starting at the point (0, x -|- e) by r^, we 
have 


VI 



E'Q [e-fo" " '1“] 


= 

EQ[e/o" 




< 

E*® [e-l’o^ 

Jo 


< 

E‘®[e/o" 


Jo 

duf^] 

< 

E^ [r.e^ 


]V2eq^ r'(^o,x+. _ 
Jo 



where the penultimate inequality follows from Jensen’s inequality and the last one 
from Cauchy-Schwartz. Since the boundary h is non-decreasing, with the help of 
Levy’s modulus of continuity theorem as well as the law of the iterated logarithm, 
we see that —)• 0 a.s. as e \ 0. Hence, by the dominated convergence theorem, 

ase\0 


with the dominating function as in the proof of Proposition 3.8 
To complete the proof of smooth-fit, it suffices to show that 


To this end, 


_ j^O x 


= O(e^) as e —0. 


/ /‘Te \ 2 


r 1 


< TE®^ 

/ (xr+^-xr)2du 

\Jo J 


Jo 


< 


< ce^. 


sup 


0<u<T 
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where c is a constant dependent on T, a, and the Lipschitz constant of V'- In the 
above, the first inequality comes from Jensen’s inequality, and the last inequality is 
a standard estimate coming from an application of Gronwall’s inequality combined 
with Doob’s inequality. This finishes the proof of the claim. 

□ 



Figure 2: The value function v{t, x) in 
the case of a normal prior with stan¬ 
dard deviation 7 = 0.5, the market 
2 volatility a = 0 . 2 . 


4 An integral equation for the boundary 


In this section we show that the optimal stopping boundary can be characterised as the 
unique solution of a non-linear integral equation. The proof follows along similar lines as 
in [13] and [20] . 


Theorem 4.1 (Optimal stopping boundary). The stopping boundary h is the unique 
solution to the integral equation 



r I-T .C^t,h(t) j 

eit 

= 1+ f e'Q 



Jt 


// duj^t,h(t) 


<h{s)} 


ds 


(4.1) 


in the class of non-positive continuous functions. 


Proof. An application of Ito’s formula (more precisely, its extension proved in m, which 
can be applied thanks to the monotonicity of h) to v[s,Xs' jeP “ yields 


V 



v(t,X*’^)-h / eft 


£j>t,:.v(r, X^’^) -h X*’^v(r, X 


t,X 


+ Xl^^)d2v{r, X*’") dZ, . 


dr 

(4.2) 
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Let us introduce a localising sequence Tn '■= inf{r > t : > n} A T; it satisfies Tn T 

a.s. as n —)> oo. Since, for all re G N, 


eft^u <^^^l;{r,Xp^)d2v{r,Xp^)dZr 


= 0 , 


from (4.2) we get 


}{T A Tn, + 


rTATn 


xi'"" duj>t,x-« 1 


Letting re —)■ oo, the equation becomes 

rT <>t,x _ 


u(r,X^^)e/‘ =u(La:) + 




'-r ^{Xl’^<h{r)} 


dr. (4.3) 


Here, the left-hand side is obtained by dominated convergence as v{T A Tn, “ 

is dominated by which is integrable; the right-hand side comes from 


monotone convergence. Substitution x = h{t) in (4.3) gives 




g/t’’ duj^t,h{t)l 


<h{r)}_ 


dr, 


which shows that h solves the integral equation (4.1). 


For uniqueness, assume that t i—?• k{t) is another non-positive continuous solution to 


(4.1) and define 




^ ^{xl’"^<k{r)} 


(4.4) 


Using (4.3), (4.4) and the Markov property, the two processes defined for s G [t,T] as 


MJ := v{s, Xl'^)e-!’t / e^*’’ 


{Xk'^^kir)} 


and 


Mf := V 


(s, Xi’^)eft g/; 


{Xl’^<h{r)} 


are easily verified to be Q-martingales. 

Claim 1 : v{t, x) = 1 for x < k{t). 

Let X < k{t) and define 7 ^ := inf{s > 0 : X^'^^ > k{t + s)} A (T — t). Then 


=v{t + jk, 


*+T')c i>t,^ 


d« 


f 

= e-l* 

= 1 


t+ 7 fc 


jt+'Jk 


^i+7*; 


xi'"^ du Yt,x-« 1 

® ^{Xk^<k{r)} 


nxXduj^t x 


du 
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where the second equality follows from (4.1). By optional sampling, 


v{t, x) = = 


which finishes the proof of Claim 1. 
Claim 2: v < v. 


t+7fe 


Suppose X > k{t) and define := inf{s > 0 : < k{t + s)} A (T — t). Then 


= 1 , 


v{t, x) = 

V{t + Tk, Xl'^r„ 

= E''^ 

eT “ du 

< v{t, 

x), 






with the first equality following from the martingality of and the optional sampling 
theorem, the second by the definition of v and (4.1). Combining this with Claim 1, the 
result is obtained. 

Claim 3 : h < k. 

Assume for a contradiction that h{t) > k{t) for some t. Let x = k{t) and define 
7 /i := inf{s > 0 : > h{t + s)} f\{T — t). Then 

0 = v{t, x) — v{t, x) 

= - M,7,J 


f 

eJt 


vt,x 


I 


xi’ du 
i+7h 


{v{t + 7 /^, -v{t + 


T-t^X 


In the first equality above, v{t,x) = 1 by the assumption h{t) > k{t), and v{t,x) = 1 by 
the definition of v and (4.1). The second equality comes from optional sampling. In the 
final expression, the first term is non-negative by Claim 2, the second term (including the 
minus sign in front) is strictly positive by the assumption h{t) > k{t) together with the 
continuity and the non-positivity of k and h. Hence we have obtained a contradiction, 
which proves the claim. 

It follows from (4.3), (4.4), Claim 2, and Claim 3 that v = v. Since v{t,x) > 1 for 
x > h{t), Claim 1 yields that h > k. In view of Claim 3, this finishes the proof. □ 


5 Parameter dependence 

5.1 Dependence of the valne function on the market volatility 

A large volatility a makes the observation process noisy, slowing down the speed of learning 
about the drift. Since the fluctuations are trend-free, the intuition is that the agent should 
benefit from a smaller market volatility a. While a full proof of this intuitive remark 
appears to be challenging, we have the following sufficient condition which guarantees 
monotonicity in a. 
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Theorem 5.1. Assume that the dispersion function if is such that is non¬ 

increasing in a. Then the value V in (2.2) is non-increasing in a. 


Proof. If a'ijj{t,x) is non-increasing in a, then both the drift term and the diffusion term 
of X are non-increasing in a. Therefore, Theorem 6.1 from [H] can be applied to prove 
that the value function v is decreasing in u. □ 


Example Suppose X has a two-point prior distribution /r = (1 — 7r)6i -pndh, where I < h. 
Then a'if{t, x) = {h — x){x — 1), so V is decreasing in a. 


i 2 2 

Example Suppose the prior distribution of X is Then aip{t,x) = ^ 2 ^'^ 2 ^ , 

which is increasing in a. Thus Theorem 5.1 does not apply. 


The difficulty in proving the intuitive conjecture that the initial value V in (2.2) 
should be decreasing in the volatility a lies in the fact that it is not true in general that 
the Markovian value function v in (3.9) is decreasing in a. We can see this in the case 
of a normal prior in Figure The picture depicts the difference between two Markovian 
value functions for the same normal prior with standard deviation 7 = 0.5, but different 
volatilities a. Nevertheless, the same picture shows that at time t = 0, the difference is 
positive, so conforming with our intuitive conjecture. 



Figure 3: The difference uo .2 — 'yo .5 
between two value functions; V 0.2 
and U 0.5 denote the value func¬ 
tions in the cases of the market 
volatility a being equal to 0.2 and 
0.5, respectively. 


As far as the optimal stopping boundaries are concerned, the lack of monotonicity of 
the Markovian value function in the volatility a manifests in that the stopping boundaries 
for different values of a may intersect. An example of this appears in Figure The 
same graph also provides intuition about how the shape of the boundary changes as one 
varies the parameter a. In particular, we get an impression what boundary to expect as 
a approaches zero or grows to infinity. 
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Figure 4: Optimal stopping 
boundaries for different val¬ 
ues of market volatility a in 
the case a normal prior with 
standard deviation 7 = 0.5. 


5.2 Dependence of the valne function on the initial prior 

Theorem 5.2. Assume that and ^2 o,re two prior distributions such that the corre¬ 
sponding volatilities 'tpi and 1/^2 satisfy 'tpi{t,x) < 'tjj 2 {t,x) for all {t,x) G [0,T] x M. Then 
the corresponding Markovian value functions vi and V 2 satisfy vi <V 2 - 


Proof. Again, Theorem 6.1 from [9] can be applied to prove that the value function v is 
increasing in fj. □ 


2 

In the case of the normal prior, the function il){t, x) = is monotonically in¬ 

creasing in the standard deviation 7 of the prior. Hence Theorem 5.2 applies and the 
Markovian value function v increases in 7 . A consequence of this is that optimal stopping 
boundaries are ordered by the size of 7 as shown in Figure 



Figure 5: Optimal stopping 
boundaries for different val¬ 
ues of standard deviation 7 
in the case of a normal prior 
when the market volatility 
a = 0 . 2 . 
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For compactly supported distributions, Theoremoffers a way to construct an upper 
bound for the Markovian value function v. Suppose the prior /r is a compactly supported 
distribution. Since V'/x is bounded, by the two-point prior example on page we can 
find a two-point distribution 77 := (1 — 7 r)( 5 a -|- 7r6b with drt) = /jgttr/(du) such that 

'ipii < Then Theorem 5.2 yields that < Vr^ and so the stopping boundaries satisfy 
hr/ < h^. As a result, ], where denotes the initial value for the prior /r, 

denotes the expectation operator under which the prior is rj instead of //. 


5.3 The value of filtering: numerical investigation 

Having introduced and solved the optimal liquidation problem for an arbitrary prior, a 
pragmatic question arises - how much is there to be gained from the elaborate sequen¬ 
tial liquidation strategy with real-time filtering in comparison to a naive optimal selling 
strategy without filtering? In this part, we address the question within our model from a 
numerical point of view in the normal prior case. 

Let us consider an agent who wants to liquidate an asset before time T. We suppose 


the asset price evolves according to (2.1) and that the agent’s prior ^ for the drift X is the 


normal distribution N{m,'y). If the agent does not know about the possibility of real-time 
filtering, he does not utilise any valuable information from the asset price observations and 
so, at time 0 , will make a decision whether to sell immediately, i.e. at time 0 , or liquidate 
at time T. The expected value from an optimal liquidation strategy with selling allowed 
only at times 0 and T is 

■= E[e^^] V 1 = V 1. 


However, if the agent is aware of an optimal sequential liquidation procedure involving 
filtering, the expected value from an optimal selling is 


V = sup E[iS't-]. 

It is worth noting that the inequality V{o,t} ^ ^ holds for any prior /i, thus giving us a 
lower bound for the value V. 

In Figure we see the two values V calculated for two different priors 

at a range of different market volatilities. In addition. Figure depicts the percentage 
improvement [V — that the sequential procedure with filtering brings over 

the naive strategy. Dependence of the two values on the standard deviation 7 of the 
normal prior is illustrated in Figure 
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Value Improvement in % Value 



Figure 6; Initial value as a 
function of market volatil¬ 
ity. The solid gray curve corre¬ 
sponds to the optimal liquidation 
value and the dashed gray line to 
the value without filtering - both 
for the normal prior A^(—0.1,0.5). 
Similarly, the solid black curve 
corresponds to the optimal liqui¬ 
dation value and the dashed black 
line to the value without filtering 
for the normal prior A^(—0.2,0.5). 



Figure 7; Improvement due 
to filtering. The percentage 
improvement {V — h{o,T})/^{o,r} 
over the strategy without 
filtering. 



Figure 8; Initial value as a 
function of standard devia¬ 
tion of the normal prior. The 

solid curve corresponds to the op¬ 
timal liquidation value while the 
dashed line to the value without 
filtering. Here the prior is nor¬ 
mal with mean —0.05, the market 
volatility a = 0.2. 
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